最近,已经开发了方法以准确地预测其在特定任务上的深神经网络(DNN)的测试性能,给定其底层拓扑结构的统计数据。然而,进一步利用这一新发现的实际应用的洞察力由于时间和记忆的高计算成本,因此是棘手的。在这项工作中,我们定义了一类新的拓扑功能,可以准确地表征学习的进度,同时在运行时迅速计算。此外,我们所提出的拓扑功能易于配备反向化,这意味着它们可以在最终训练中纳入其中。我们的新开发的DNN实际拓扑表征允许额外的应用程序。我们首先显示我们可以预测没有测试集的DNN的性能,而无需高性能计算。我们还证明了我们对DNN的拓扑表征在估计任务相似性方面是有效的。最后,我们表明我们可以通过主动限制DNN的拓扑结构来诱导DNN中的学习。这使得在元学框架中收缩了DNN的基础结构来开辟了新的途径。
translated by 谷歌翻译
随着超维数据的大数据分析的最新激增,对机器学习应用程序的降低技术的兴趣重新引起了人们的兴趣。为了使这些方法提高绩效提高并了解基础数据,需要确定适当的指标。此步骤通常被忽略,通常会选择指标,而无需考虑数据的基本几何形状。在本文中,我们提出了一种将弹性指标纳入T分布的随机邻居嵌入(T-SNE)和均匀的歧管近似和投影(UMAP)的方法。我们将方法应用于功能数据,该功能数据以旋转,参数化和比例为特征。如果这些属性被忽略,它们可能会导致不正确的分析和分类性能差。通过我们的方法,我们证明了三个基准数据集(MPEG-7,CAR数据集和Themoor的平面数据集)的形状识别任务的提高,我们分别获得了0.77、0.95和1.00 F1分数。
translated by 谷歌翻译
本文基于Loeffler离散余弦变换(DCT)算法引入了矩阵参数化方法。结果,提出了一类新的八点DCT近似值,能够统一文献中几个八点DCT近似的数学形式主义。帕累托效率的DCT近似是通过多准则优化获得的,其中考虑了计算复杂性,接近性和编码性能。有效的近似及其缩放的16和32点版本嵌入了图像和视频编码器中,包括类似JPEG的编解码器以及H.264/AVC和H.265/HEVC标准。将结果与未修饰的标准编解码器进行比较。在Xilinx VLX240T FPGA上映射并实现了有效的近似值,并评估了面积,速度和功耗。
translated by 谷歌翻译
量子哈密顿学习和量子吉布斯采样的双重任务与物理和化学中的许多重要问题有关。在低温方案中,这些任务的算法通常会遭受施状能力,例如因样本或时间复杂性差而遭受。为了解决此类韧性,我们将量子自然梯度下降的概括引入了参数化的混合状态,并提供了稳健的一阶近似算法,即量子 - 固定镜下降。我们使用信息几何学和量子计量学的工具证明了双重任务的数据样本效率,因此首次将经典Fisher效率的开创性结果推广到变异量子算法。我们的方法扩展了以前样品有效的技术,以允许模型选择的灵活性,包括基于量子汉密尔顿的量子模型,包括基于量子的模型,这些模型可能会规避棘手的时间复杂性。我们的一阶算法是使用经典镜下降二元性的新型量子概括得出的。两种结果都需要特殊的度量选择,即Bogoliubov-Kubo-Mori度量。为了从数值上测试我们提出的算法,我们将它们的性能与现有基准进行了关于横向场ISING模型的量子Gibbs采样任务的现有基准。最后,我们提出了一种初始化策略,利用几何局部性来建模状态的序列(例如量子 - 故事过程)的序列。我们从经验上证明了它在实际和想象的时间演化的经验上,同时定义了更广泛的潜在应用。
translated by 谷歌翻译
最近的证据表明,SARS-COV-2是2020年导致全球大流行病的病毒,主要经由室内环境中的空气机气溶胶传播。在评估和控制建筑物的室内空气质量(IAQ)时,这需要新颖的策略。 IAQ通常可以通过通风和/或政策来控制以调节人建筑物相互作用。然而,在建筑物中,占用者使用其他方式使用房间,可能并不明显哪种措施或对措施的组合导致成本和能源有效的解决方案,确保整个建筑物的良好IAQ。因此,在本文中,我们介绍了一种基于代理的模拟器,亚拟合,旨在帮助通过估计足够的房间尺寸,通风参数和测试政策的效果来帮助创造新的或适应现有建筑物,同时考虑到IAQ的结果复杂的人建筑物相互作用模式。最近公开的气溶胶模型适于计算每个房间中的时间依赖性二氧化碳($ CO_2 $)和病毒量子浓度,每天吸入$ CO_2 $和病毒量子,作为生理反应的衡量标准。由于其模块化架构,Archabm对气溶胶模型和建筑布局具有灵活性,这允许实现进一步的模型,任何数字和房间,代理和操作的行动,反映人建筑物交互模式。我们提供了一个基于我们研究中心采用的真正平面计划和工作时间表的用例。本研究表明,先进的仿真工具如何有助于改善建筑物的IAQ,从而确保健康的室内环境。
translated by 谷歌翻译
The recent increase in public and academic interest in preserving biodiversity has led to the growth of the field of conservation technology. This field involves designing and constructing tools that utilize technology to aid in the conservation of wildlife. In this article, we will use case studies to demonstrate the importance of designing conservation tools with human-wildlife interaction in mind and provide a framework for creating successful tools. These case studies include a range of complexities, from simple cat collars to machine learning and game theory methodologies. Our goal is to introduce and inform current and future researchers in the field of conservation technology and provide references for educating the next generation of conservation technologists. Conservation technology not only has the potential to benefit biodiversity but also has broader impacts on fields such as sustainability and environmental protection. By using innovative technologies to address conservation challenges, we can find more effective and efficient solutions to protect and preserve our planet's resources.
translated by 谷歌翻译
We present the interpretable meta neural ordinary differential equation (iMODE) method to rapidly learn generalizable (i.e., not parameter-specific) dynamics from trajectories of multiple dynamical systems that vary in their physical parameters. The iMODE method learns meta-knowledge, the functional variations of the force field of dynamical system instances without knowing the physical parameters, by adopting a bi-level optimization framework: an outer level capturing the common force field form among studied dynamical system instances and an inner level adapting to individual system instances. A priori physical knowledge can be conveniently embedded in the neural network architecture as inductive bias, such as conservative force field and Euclidean symmetry. With the learned meta-knowledge, iMODE can model an unseen system within seconds, and inversely reveal knowledge on the physical parameters of a system, or as a Neural Gauge to "measure" the physical parameters of an unseen system with observed trajectories. We test the validity of the iMODE method on bistable, double pendulum, Van der Pol, Slinky, and reaction-diffusion systems.
translated by 谷歌翻译
While the brain connectivity network can inform the understanding and diagnosis of developmental dyslexia, its cause-effect relationships have not yet enough been examined. Employing electroencephalography signals and band-limited white noise stimulus at 4.8 Hz (prosodic-syllabic frequency), we measure the phase Granger causalities among channels to identify differences between dyslexic learners and controls, thereby proposing a method to calculate directional connectivity. As causal relationships run in both directions, we explore three scenarios, namely channels' activity as sources, as sinks, and in total. Our proposed method can be used for both classification and exploratory analysis. In all scenarios, we find confirmation of the established right-lateralized Theta sampling network anomaly, in line with the temporal sampling framework's assumption of oscillatory differences in the Theta and Gamma bands. Further, we show that this anomaly primarily occurs in the causal relationships of channels acting as sinks, where it is significantly more pronounced than when only total activity is observed. In the sink scenario, our classifier obtains 0.84 and 0.88 accuracy and 0.87 and 0.93 AUC for the Theta and Gamma bands, respectively.
translated by 谷歌翻译
Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or approximate inference. Rather, it can occur in classical probabilistic models even with exact inference, which we also demonstrate. Based on these results, we propose a class of latent-identifiable variational autoencoders, deep generative models which enforce identifiability without sacrificing flexibility. This model class resolves the problem of latent variable non-identifiability by leveraging bijective Brenier maps and parameterizing them with input convex neural networks, without special variational inference objectives or optimization tricks. Across synthetic and real datasets, latent-identifiable variational autoencoders outperform existing methods in mitigating posterior collapse and providing meaningful representations of the data.
translated by 谷歌翻译
There are multiple scales of abstraction from which we can describe the same image, depending on whether we are focusing on fine-grained details or a more global attribute of the image. In brain mapping, learning to automatically parse images to build representations of both small-scale features (e.g., the presence of cells or blood vessels) and global properties of an image (e.g., which brain region the image comes from) is a crucial and open challenge. However, most existing datasets and benchmarks for neuroanatomy consider only a single downstream task at a time. To bridge this gap, we introduce a new dataset, annotations, and multiple downstream tasks that provide diverse ways to readout information about brain structure and architecture from the same image. Our multi-task neuroimaging benchmark (MTNeuro) is built on volumetric, micrometer-resolution X-ray microtomography images spanning a large thalamocortical section of mouse brain, encompassing multiple cortical and subcortical regions. We generated a number of different prediction challenges and evaluated several supervised and self-supervised models for brain-region prediction and pixel-level semantic segmentation of microstructures. Our experiments not only highlight the rich heterogeneity of this dataset, but also provide insights into how self-supervised approaches can be used to learn representations that capture multiple attributes of a single image and perform well on a variety of downstream tasks. Datasets, code, and pre-trained baseline models are provided at: https://mtneuro.github.io/ .
translated by 谷歌翻译